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Abstract 

We consider the tree-reduced path of symmetric random walk on 7L d . It is interesting to ask about the 



fvj number of turns T n in the reduced path after n steps. This question arises from inverting the signatures 

of lattice paths. We show that, when n is large, the mean and variance of T n in the asymptotic expansion 
have the same order as n, while the lower terms are O(l). We also obtain limit theorems for T n , including 
large deviations principle, central limit theorem, and invariance principle. Similar techniques apply to 

P^ ■ other finite patterns in a lattice path. 
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^ ■ 1 Introduction 

O . 

iy-> Let G be the free group with d generators ei, • • • , ea- Start with the empty word at time 0. At each time 

f/-\ ■ k, choose one from the 2d elements (d generators and their inverses) uniformly randomly to multiply the 

current word on the right. For example, the first six choices: 

e2,e3,e3 ,e2,ei , e4 

will produce the reduced word e2e2ei _1 e4 at time 6. Every word at time n has a unique reduced word 
with length at most n. It is then interesting to ask about the length and number of turns in the reduced 
"^ ■ word. 

Definition 1.1. Let w be a word, and w — ai 1 ■ ■ ■ ai k be its reduced word, where a,j is either e_, or ej 
Define the number of turns of w to be T n — #{ai,a,i +1 : a,;. 7^ a,: +1 , 1 < j < k — 1}. 

Then, two words have the same number of turns if they reduce to the same word. In the above 
example, the number of turns in the reduced word e^e^e^ 1 et, is 2. In another language, it is the number 
of times a reduced path of random walk has switched its direction. 

The main goal of this paper is to calculate asymptotics for T n when n is large. The question of 
estimating T„ arises from inverting signature for lattice paths, where at most T + 2 terms in the signature 
are needed for inversion if one knows in advance that the reduced lattic path has T turns. 



1.1 Motivation from inversion of signature for axis paths 

In this subsection, we give some background material on path-signature that motivates our study of the 
current problem. A path 7 : [s, t] — > R d is a continuous function mapping a time interval into R d . The 
length of the path is defined as 

|7| :=supd(7(ti),7(tj+i)), 

v 

where d the metric on R d , and the supremum is taken over all finite partitions of [s,i\. If I7I < +00, we 
say 7 has bounded variation. Let BV(R d ) denote the space of all paths of bounded variations in R d . 

Definition 1.2. Let 7 : [s,t] — > R d be an element in BV(R d ). The signature 0/7, X 8> t('y), is defined as: 

x., t (i) = 1 + xi, t (i) + ■■■ + x s n t ( 7 ) + ■ ■ ■ , 

where 



X^ t ( 1 )= d7(wi) ® • • • ® rf7(w„) (1) 



as an element in (R ) 



d yS : '? 



Let (ei,e2, ••• , Sd) be a standard basis of R d , then 7 can be written as (71,72,- •■ >7d). If to = 
e «i ' ' ' e i n i s a word of length n, we write 

C s ,t{w) = C s ,t{ei 1 - ■ ■ e in ) = / d7«i(«i)'"d7» n («n) 

<t s<lii<- ■ ■ <it n <£ 

as the coefficient of w. As all words of length n form a basis of V®" , we can rewrite X" t (-y) as the linear 
combination of basis elements: 

x?Ai) = E C.,t(v>)w, (2) 

|iu|— n 

where the sum is taken over all words of length n. 

Re-parametrizing the path does not change the signature. For any path a : [0, s] — ¥ V and /3 : [0, t] — > 
1/, we can form the concatenation a * ft : [0, s + t] — > V, as follows: 

a *8( u )-f «(«),« 6 [0, a] 

and similarly, the decomposition of one path into two can be carried out in the same fashion. 
For any path 7 : [s,t] — > V, the path "7 run backwards", 7 -1 , is defined as: 

7~ 1 (u) := 7(s + t — u),u G [s,t], 

and the trajectories of 7 * 7 _1 cancel out each other. 

Concatenation and "backwards" of paths of bounded variation are still paths of bounded variation. 
In fact, we have |a*/}| = \a\ + \/3\, and |7 _1 | = 7J. The following proposition, first proved by Chen ([I]), 
asserts that the signature map is a homomorphism from _BV(R d ) to the tensor algebra. 



Proposition 1.3. Let q,/3 6 BV { R d ). Then, X(a * /3) = X(a) ® X{/3). 

Hambly and Lyons (,4 ) showed that if a,/3 G BV{R d ), then X(a) = X(/3) if and only if a * /3" 1 
is tree-like, a continuous analogue of a null path. This tree-like relation defines an equivalence relation 
on _BV(R d ). Within every equivalent class, there is a unique path with minimal length, called the tree 
reduced path. An interesting question would be, given a signature X of some path of bounded variation, 
can one reconstruct the tree reduced path with the same signature XI 

For the case of axis paths, the answer was provided by Lyons and Xu in [6]. 



Definition 1.4. 7 : [s,t] — >• R d is a (finite) axis path if its movements are parallel to the Euclidean 
coordinate axes, has finitely many turns, and each straight line component has finite length. 

Any axis path has a unique reduced axis path; integer lattice paths are special cases of axis paths. 
An R d axis path can move in d different directions (up to the sign). At time 0, it starts to move along 
a direction d x for some distance n; then it turns a right angle, and moves along ei 2 for a distance ri, 
and so on, and stops after finitely many turns. Thus, up to re-parametrization, an axis path 7 can be 
represented as: 

y— (rteij * •••* (r n ei„) (3) 

where n's are real numbers, with the sign denoting the directioio 

Using Chen's identity (Proposition ll.3|) . the signature of 7 can be conveniently expressed as 

X(j) = exp(rie il ) <g> • • • <g> exp(r„e in ), 

which should be understood as the product of n power series in the letters {e^ , • • ■ , et„ }. 

If 7 is already in its reduced form, then it is clear that ik 7^ ifc+i, and we call the word w — 
(ei 1 , • • • , ej„) the shape of 7. If a word w is in its reduced form, we use \w\ to denote the number of 
letters in w, or the length of w. We introduce the notion of square free words to characterize an axis 
path. 

Definition 1.5. Let w — e.% x ■ • ■ e^ n be a word. We call it a square free word i/Vfc < n ~ 1, ik 7^ ife+i- 

The following theorem, provided by Lyons and Xu ([6]), gives an inversion procedure for finite axis 
paths. 

Theorem 1.6. For any finite axis path 7, there exists a unique square free word w with the property that 
C(w) 7^ 0, and that if w' is any other square free word with C(w') 7^ 0, then \w'\ < \w\. Furthermore, 
suppose the unique longest square free word is W — a x • • • ei n , and let 

._ 2 

Wk :— &i x • • ■ ei k _ 1 e ik ■ ei k+1 ■ e; n , 

which has length n + 1, then we have 

7 = (neij * • ■ • * (r„e< n ), 

where r k = 2§^p . 

Thus, one sees if an axis path has n turns, then at most n + 2 terms in the signature are needed for 
inversion. For a lattice path with length L, it can have at most L — 1 turns, so we only need the first 
L + 1 terms in the signature to recover it. 

In pratice, lattice paths are often generated by drawing n letters and their inverses uniformly randomly 
from an alphebet, and putting them in the order they are drawn. It is then intersting to ask about the 
number of turns in its reduced path. 



1.2 Outline of the method and summary of results 

If one writes T n = X^=i ^»i where Vi denote the number of turns created at step i. In general, Vi can be 
1, 0, or — 1, and are correlated. The distribution of Vi depends on the whole history in the past. 

On the other hand, one can condition on the length of reduced path L n . Then T n \L n has a binomial 
distribution. A detailed study of L n yields asymptotic behaviors of T n . A natural coupling L n = S n + D n 
simplifies the study of L n , where S n is a sum of n i.i.d.'s, and D n is dominated by a geometric random 
variable. 

The main results in this paper are: 

1 We mean —re, = re~ . 
J 3 



Proposition 2.5. ET n — d(2d-i) n ~ * ~ 2< d(2it-i) 1 ' var Tn d^dd-iV 1 a ' s0 conver g es - 

Theorem 3.1. (Large Deviations Principle) The sequence of the laws for the random variables 
{— -} n >i satisfy the large deviations principle with rate function 

I(x) = mp[0x -log h(0)], 

e 

where 

j-Md - l)e° + 1+ ^-\ )et + 1], 9 > log ^ 



2d-l-l 

I — -;\..'_.\(1 — I If -+- : — ■ t -i" ! . (/ ..-■■■- ID': — 

h{6) 



< log ■ 



2(d-l) 



Theorem 4.5. (Invariance Principle) _Fbr each n, define a C° ([0,1]) -valued random variable 
{Wi n) : t 6 [0, 1]} by 

/or t £ —[n], and linearly interpolated for other values oft. Then the sequence converges in law to the 
standard one dimensional Brownian motion on [0, 1] as n — > oo. 

As a generalization, analogous results hold for the number of occurrences of any finite collection of 
finite length pattern V — {Pi = (e^ , . . . , ei k . )} in a lattice path: the key is to establish central limit 
theorem for the number of occurences of elements in V conditioned on the length of the path L± , which 
is essentially an i.i.d. sum of in-dependent random variables (see [7]), where m is bounded above by 
max; ki. The number of turns T t corresponds to V = {Pij — (ej, ej) : i ^ j}. For the sake of clarity, we 
will focus only on the number of turns. 

The paper is organized as follows: 

In section 2, we show that the lower order terms in the mean and variance of L n are O(l); we 
then obtain similar results for T n . A key ingradient in the derivation is to prove cov (S n ,D n ) is O(l). 
We compare it with cov (S n +l,Dn+i), and show that their differnce decays exponentially with n, thus 
proving convergence. 

Section 3 is devoted to the proof of the large deviations principle for { — }. We derive the rate 
function, and thus prove the principle, by comparison of the Laplace transform of L n with that of S n - 
It turns out that the rate function deviates from the normal one as predicted by S n on the lower side of 
the real line. 

In section 4, we prove the central limit theorem and invariance principle for T n . This result shows 
that, although the components of T n are correlated, the increments are still asymptotically independent 
under proper scaling. 



2 Lower order terms in mean and variance 

Let L n denote the length of the reduced path after n steps. Then, T n |L n has a binomial distribution with 
parameters (L n — 1, fjjEf )■ Let Xi be a sequence of indenpedent and indentically distributed random 
variables with P(X; = 1) = 2d ~ d 1 , and P(JQ = —1) = ^. Let Lq — 0, then L n can be defined inductively 
as follows: 

Li+i — Li + Xi+i, Li > 
L i+ i = l,Li = 

We want to compare L n — S n , where S n — X/iLi-^*- ^ ^ s weu known that -^- —} =^X almost surely. We 
compute a finer estimate to show that EL n — ^f-n — 0(1). 

Since Li — Si does not change when L is away from 0, so difference only occurs when L hits 0. Let R n 
denote the number of times that L„ hits to after the first step up to time n. Since P(L ever comes back 



to 0)~ 2 d-i ' ^" convereges to a geometric distributed random varible R, with P(i? = k) — 2d-'i ( 2d-i ) k ~ 1 
(see e.g. [3]). Here, step is counted as a return, because L and 5 can be different in the first move. Let 
D n = L n — S n , then i_D n |i? n _i has a binomial distribution with parameters (R n -i, ^). fn particular, 
D n < 2R n -i. So the mean of the difference is: 

ED n = EE(D n ji? n _i) = -Ei? n _i 



d 2d(d-l) 

Thus, we get an error term for EL„: 

Lemma 2.1. Let L n denote the length of the reduced word after n steps, then 

hm (EL n — n) = 



d ' 2d(d-l) 



Now to compute the lower order terms in var L n . Since var L„ = var S n + 2cov (S„, D n ) + var D n , 
it suffices to show that cov (S n , D n ) = 0(1). We show it in the following lemma. 



Lemma 2.2. Under the above coupling L n — S n + D n , we have: 

lim {ES n D n - ES n ED n ) = -«(d), 

n— Voo 

where < u(d) < ^ 3 (^-iK^+2d-i) 

Proof. Let [/„ = (S>, — ^-^-n)D n . We show that E[/ n +i — E(7 n decays exponentially fast. 

U„+x = l {Ln=0 }(S n + X n+1 — (n + l))(D n + 1 - X n+1 ) + l {Ln>0} (S n + X n+1 — (n + l))D„ 

= U n + (X n+1 — )D n + l {Ln=Q} (l - X n+1 )(S„+i 3— (n+1)) 

a a 

Since X n +i is independent of D n , and — ro < SVi < when conditioned on L n = 0, taking expectation 
on both sides yields: 

-4(n + 1)P(L„ = 0) < EU n+1 - EU n < 0. 

Since P(L 2 n+i = 0) = 0, and 

P(i2n = 0) < P(S 2 „ < 0) 

S ^- 4rf j l 2d j 

fc=0 

2rf-l 2d-f 
-2(d-f) 1 d 2 j 

This shows that EU n is decreasing and bounded below, and thus it has a finite limit. 

Adding up all (EU n - EU n -i) gives EU n -> -u(d), where < u(d) < 2d2 ( 2d -^>(f ) + 2d - 1 K D 

Remark 2.3. The negative correlation agrees with one's probabilistic intuition: when S n is small, the 
process L n tends to visit more times, and thus D n is likely to be large. 

Proposition 2.4. Let u(d) be the constant as in the previous lemma. Then, 

lim (var L n — n) = /3(d), 

n—>oc a 



where /3(d) = -2u(d) + &=$%£=**& . 



Proof. Since var D n — Evar (D n \Rn-\) + var E(D„|_R n _i) — > HKd-i)^ + > we nave: 

var L n — — n — 2cov (S„, D n ) + var D n — > /3(d), 

where /3(d) = -2u(d) + e^g^g±3 . n 

Combining the above estimates for L n , we then have similar estimates for T n : 

ET„=EE ( T„|L„) = f^E£„-f^ 

var T n = Evar (T n |L„) + var E(T n \L n ) 



(2d -l) 2 (2d -l) 2 (2d -l) 2 ' 

which gives the following proposition: 

Proposition 2.5. Let /3(d) 6e i/ie error ierm in far L„ as above. Then, 

.. ,_,„, 2(d-l) 2 , 2d 2 -4d+l 

n^ (ET " - d(2d^T) n) = ~ d(2d-l) ■ 

and 

2(d-l) 2 (5d-2) , 4(d-l) 2 ,. ,, 2d 2 - 4d + 1 

^J^ T " ~ d2(2dil)2 ^ = (k^ m - d(2d-l)2 - 



3 Large deviations 

The goal of this section is to prove the following large deviations theorem for — . 

Theorem 3.1. The sequence of the laws for the random variables { — } n >i satisfy the large deviations 
principle with rate function 

I(x) =sap[0x- log h(6)], 

e 

where 

We postpone the proof of this theorem to the end of the section. In light of Gartner-Ellis theorem 
(see [2] section 2.3), it suffices to show that 

lim (Ee" Tn )" =h{0) 

n— > + oo 

for every 9 £ R, and the limit h is essentially smooth. 

Note that T n \L n has a binomial distribution with parameter ( 2 ^~ 2 L n — 1) for L n > 1, and T n — 
if L n — 0, we have 

Ee eT " = P(L„ = 0) + P(Ln > 0)Ew{9) Ln - 1 , (4) 



where w{9) = ^4rf e + 1- The first term is bounded by 

n 

P(L„ = 0) < ^P^ = -fc) 



,y/2d- l. n 

and we need to estimate Ew;(6) Z/ " for large n. 

In the context below, we regard w to be a positive real number independent of 8, and study the 
asymptotics of (Ew Ln )~ asn-> +00. 

Proposition 3.2. If w > 1, then we have 

.. ,_, iB .i 2rf-l 1 1 

hm Ew ) " = w H • — . 

Proof. We compare the difference between Era 1 ™+ 1 and ¥.w Ln : 

Ew Ln+1 = El {in=0} w L "+ 1 +El {in>0} w i "+ 1 
= w¥(L n = 0) + El {i7i>0} w L " +x "+ 1 



L n 



= w¥(L n = 0) + Ew x " +1 El {i7i>0} w 

= wF{L n = 0) + Ew x » +1 Ew in - Ew x " +1 F(L n = 0) 

= (^7T^™ + h~)^ Ln + ^(v> - -)P(£« = 0) 
2d 2d w 2d w 

Let a; n = Ew in ,a = 2 ^~ 1 w + ^^,& = ^(w — — ),p n — V(L n = 0), we have the following recursive 
relation: 

x„ = ax„-! + bpn-i 

Since xi = w, adding them up yields: 

n— 1 1 7 / n — 2 . n — 3 , , , \ 

x„ = a w + b{a pi + a p- 2 -\ +ap n -2+Pn-i) 

For w > 1, we have 6 > 0. In this case, since a n ~ < x n < na n ~ 1 from the expression above, we get: 

,. , ,1 2d-l 11 

hm (xn) n =a= - w + — — , 
n-1-00 2d 2d w 

thus proving the proposition. □ 

The situation for w G (0, 1) is more involved. We prove it based on comparison with (Ero ")». Note 
that S„ is a sum of i.i.d., so by Cramer's theorem, it satisfies large deviations principle with rate function 

J(x) = sup[6»x - log( 2d e e + 2^e"")]. 



Lemma 3.3. For any w > 0, the equation 



2d 2d w 

(2d-l)w 2 -l 



2d-l 1 1 , r . 

w + ^ ■ ~ (5) 



has a unique solution at a* = ) 2d _-i\ -±+-\ ■ Furthermore, a* is the global maximizer for 



// \ a —J(a) 

w (a) = w e ; . 



Proof. We first give an expression of J in terms of a only. It is clear that J(a) = +00 for \a\ > 1. For 
a G [—1,1], the maximizer 6* is 

r(a) = i[logi±^-log(2d-l)],ae[-l,l], 
2 1 — a 

passing to the limit ±00 for a — ±1. Substituting into J, we have 



for a G [— 1, 1]. Differentiating with respect to a, we obtain 

j'( a ) = I[logi±^-log(2d-l)] (6) 

for a G (— 1, 1). Note that 

f w (a) = ^(™ ae ~ J(Q) ) = ^ Q e- J ( Q »(log^ - J' (a)), 

and since J is convex, f w has the global maximizer a* satisfying 

J (a*) = log w. 

By (JSJ, solving the above first order condition yieds 

«_ (2d- l)w 2 - 1 
Q ~ (2d-l)iu 2 + l' 

and thus 

, , *s 2d-l 1 1 

Since a* is the global maximier of f w , we conclude that equation ([SJ has a unique solution at a* . □ 
Proposition 3.4. £ef a* = a*(w) — ^ ~ j™^" 6e as above, then 



lim lim (Ew t, "l / s„ = , , » , ,,) — "" , ^ w + -— - — 



5„, 2d-l . 1 1 

{% l S(a*-e,c**+e)}> 1 



This proposition shows that the major contribution for Era ™ are from the S n 's with values near a*n. 
Lemma 3.5. (a) Let a > 0. Ve G (0, a), V<5 > 0, 3N = JV(a, e, 5) sucft i/iaf 

P(L„-S„<*n|— e(a-e,o + c))> J 

n 2 

/or a?/ n> N. 

(b) Ve, 5 > 0, 3N = AT(e, 5) sucfc tftaf 

P(L„ < <5n|S ?l < -en) > i 
/or all n> N. 

Proof. We prove part(a), and the proof for part (b) is similar. Observe that 

D n = L„ — S n — 2j min Sk\, 

0<k<n 

we first consider the quantity P(mini<fe<„ St, > — o~n -^ = an), where without loss of generality, we have 
assumed ^-^-n is an integer, and have replaced | by 8. Once conditioned on the event {^- — an}, all 

possible paths contain ( 1+a ) positive movements and I x _ a ) negative movements. Since all these 



paths have the same (conditional) weight, the quantity P(mini<fc< n Sk > — 5n\— — an) is independent 
of d. Thus, we may assume d — 1, where all paths are the trajectories of the (conditional) simple 
symmetric random walk. That is, 

P( min Sk < —5n\S n — an) = P( min Sk < —Sn\S n — an), 

()<k<n 0<k<n 

where S n is a one-dimensional simple symmetric random walk. By reflection principle, we have 
P( min S k < ~8n, S n = an) = P(S n = -[(a + 28) J n). 

0<k<n 

Using Stirling's approximation, we estimate the ratio 



¥(S„ = - [{a + 2S)n)\ _ ( n \ ,( n 



(i=^») ' Wn) 



¥(S n = an) 

1 -a . .,1 + a .,.,,1 - c 
= (— g— ")K— j— n ) ! /[( 

(l-a) 1 -°'(l + a) 1+a 



,1 — a ,,,1 + a .. ...1 — a — 2<5 ...l + a + 2<5 . 
= (— g— ")K— j— n ) ! /[( g ^ ^ 2 ^ 



L (l-a-2<5) 1 -«- 2 ' s (l + a + 2<5) 1 + Q + 2 ' 5J ' 

ft is straightforward to check that (1 — x) log(l — x) + (1 + a) log(l + x) is increasing in (0, 1), thus 
the quantity in the bracket in the last line is strictly less than one, and the ratio decays exponentially. 
So, we have 

P( min S k < -8n\ — G {a - e, a + e)) 

0<fc<n n 

Y P( min 5 fc < -fci, — G (a - e, a + e))/P( — G (a - e, a + e)) 
* — ' i<fc<n n n 

<n sup P(S„ = -(/3 + 25)n)/P(— G (a-e,a + e)) 

/3e(a-£,a + e) n 

and consequently 

P( min > -<Sra|— G (a-e,a + e)) ->• 1 

0<fc<n n 



asn-> +oo, thus proving (a). The proof for part (b) is essentially the same, and we omit it here. 



Proposition 3.6. For w G (0, 1), the limit (Ew n ) « exists as n — > +oo, and we have 



U 



-'-^ + ^.1, »G(-^,1) 



V /TTT' L„\- ! 2d 2d "UJ ' 



Proo/ . i. w e (_!_ 1}i Q * = iH^il^i > 0. 
In this case, we have 

(Ew L ")i > [E( TO L "l {L7i _ s „< in} |— G (a*-e,a*+e))P(— G (a* - e, a* + e))]* 

n n 

> w a '+*+ s W(^ G (a* - e, a* + e)) »P(L„ - S„ < <5n| — G (<** - e, «* + e)) • 
Taking n — ► +oo on both sides yields 

liminf(Ew ")™ >iu° 

n— » + oo 



Since e and 8 are arbitrary, we have 



liminffEw 1 '")* > w a *e- I(at) = — -to + — • -. 

n^+oo x 2d 2d w 



On the other hand, Ew L " < Etu " for all w < 1, so 



Thus, we conclude that 



.. ^ i 2rf-l 1 1 

hmsup(Eit) ")» < — — -— w; + — • — . 
n^+oo 2d 2d to 



hni (iLw I " = ; — iy H ; • — 

n->+oo 2d 2d w 



for M £(^EI,l). 

2. to G (0, ^5=), and a* < 0. 
In this case, we have 

(Ew L ")i > \E(w L "l {Ln < Sn} \S n < -en)P(S„ < -en)]' 
> ™*P(L n < <5n|S„ < -en)«P(5„ < -en)*. 
Again, sending n — > +oo and take e, <5 arbitrarily small yields 



liminf(Eu) ")~ > 



n — >-\-oo 



On the other hand, 



by monotonocity, we have 



and thus 



for all w G (0, — ^ 



lim lim (Eiu 

w I I n— V + oo 



d 



L„,i _ ^2d~T 



lim sup (Ew ")" < 



r X _L „ V2d~l 



n — > -+- oo 



urn sup(Eio ) " = 



n — f + >3G 



3. w — / 2 j_, , Q* = 0. Again, by monotonocity in u>, we have 



lim (EV2d-T- L ")i = ^E±. 

n—y + oo a 



a 



The following corollary is an immediate consequence of Propositions 13,21 and 13.61 

Corollary 3.7. The laws for the random variables { — }„>i satisfy the large deviations principle with 
rate function 

Il(x) = sup[9x — logfti(0)], 

6 



where 



-'-V + ie- 9 , 0>-±log(2d-l) 



h IO\ — ) 2d '2d ' — 2 

n L{?) — S V2d-1 a ^ l„„ V2d-1-1 



10 



Now we are in a position to prove Theorem 13.1 



Proof of Theorem I37TI 

Proof. Recall that 

Ee »T„ = P ( Ln = o) + P(L n > l)Ew(6') i; 
where we have 



p(l„ = o) < c( V2d d Y 

for all n. On the other hand, Propositions 13.2 1 and 13.61 imply that 



lim (Eu/" 1 )" > 
for all w > 0. Thus, we see that 

lim (Ee eT ")~ = lim {Ew(8) Ln )i = h(8), 

n — >-(-oo n— f + oo 

where h is defined in the theorem. It is also straigtforward to check that h is essentially smooth. Thus, 
Gartner- Ellis theorem implies that the laws for {-^j satisfies the large deviations principle with rate 
function 

I(x) = sup[0z-log/i(0)]. 

e 

□ 



We end this section with two remarks. 

Remark 3.8. As an alternative to Gartner-Ellis theorem, one can compute the rate function for — 
directly as follows: 

P( — G (a-e,a + e))=V>(— G (a - e, a + e)|— e 09 - 5,0 + 5)) ■ P(— e (/3-<5,/3 + 5)) 

where the sum is taken over appropriate /3's G (0, 1). In each product, the first probability is known as 
T n \L n has a binomial distribution, while the second probability can be aysmptotically computed from the 
LDP for L n . Finally, sending S — >• 0, one can get the rate function for { — }■ 

Remark 3.9. We give a heuristic explaination why the rate function of { — } takes this form. Let 
w = e v , and consider the Laplace transform 

{Ew L ")i = (Ee vL ")i. 

For w > w* = , * , the limit exists and is equal to 2 ^~ 1 e v + ^e" 1 ', where w — e v . Call the limit of 
the right hand side A(n), if it exists. One sees that rf = \ogw* is the minimizer of A, and thus 



.. . ^ V2rf-1 

for all r\ < n* . On the other hand, if A(ry) exists, it must be convex. Because L n > 0, it also must not 
(strictly) increase when r\ becomes smaller. So the convexity in n forces the graph to the left ofn* = logui* 
to be fiat, and is thus equal to I" 1 . 
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4 Central limit theorem and the invariance principle 

In this section, we derive asymptotic distribution of T n for large n, including central limit theorem and 
the invariance principle. Under the natural coupling L n = S„ + D n , D n < 2R„, where R n is (almost) 
surely bounded by a geometric random variable R. As we will be frequently using this property, we state 
it as a lemma below. 

Lemma 4.1. Let L n — S„ + D„, and R n be defined as above. Then, D„ < wR n , and R = sup n R n has 
a geometric distribution with V(R n = k) = 2d-i ' ( 2^-1 )" ■ 



Theorem 4.2. Under the above assumptions, -4=(L n — ^n) converges in distribution to N(0 
and -y=(T n — d(2 d-i) n ) converges in distribution to N(0, ^ild-ir 2 ) • 
Proof. For any x € M, 



F n (x) = V(-={L n — - n) < x) 

\/Tb (Jb 

= P( 1 {Sn -™-l±n) + ^<x) 
Jn a* Jn 



-* F(x), 

where F is the distribution function for iV(0, 2d M 1 ). The convergence in the last line follows from the 
fact that —p± — > almost surely, as R — sup n R n is almost surely finite. 



Now we compute the moment generating function for -j=(T n — | _ . n). For simplicity, let /i — 

d(2d-l) and P = 2d^T • 

= e- e ^EE(e^ T ' l \L n ) 

= e- etl ^E(l -p + pe^f^ 1 

= e -e Mv Ar Ee (i„-i)iog(i+^|+^+ (i)) 



\e 



7^ 



(d-l)^(5d-2) c 2 



The convergence in the last line follows from the law of large numbers and central limit theorem for 

1 (T _ 2(d-l) 2 

v^ ™ d(2d-i)' 



L„, and dominated convergence. This implies that -j=(T n — ^d-i) n ) conver g es m distribution to 



2(d-l)- i (5d-2) > 
d' A (2d-l)' A - 



iV(0, 

n 



Since L„ behaves very much like S„, one expects that under proper scaling, it converges to the 
standard Brownian motion. 

Proposition 4.3. Let (fl,P, .F) be a probability space affording the discrete time process {Si,i £ N}. For 
each n £ N, define a C°([0, 1]) -valued random variable {W t : t € [0, 1]} by 



d-1 , , / 2d - 1 



W t (n) = [L tn - L-±tn]/yfi 



d 2 



for t € — [n], and linearly interpolated for other values of t. Then the sequence converges in law to the 
standard one dimensional Brownian motion on [0, 1] as n — > 00. 
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Proof. This is plain in light of Lemma 3.4, as L[ tn ] — Su n \ is bounded by a geometric random variable, 
uniformly in t. □ 

More difficult is the invariance principle for T n , the number of turns at time n. We prove it in the 
next theorem. Let o 2 = ^iild-iV 1 ' then, 

Theorem 4.4. For each n, define a C°([0, 1]) -valued random variable {W t :i£ [0, f]} 6j/ 

for t £ i [n], and linearly interpolated for other values oft. Then the sequence converges in law to the 
standard one dimensional Brownian motion on [0, 1] as n — > oo. 

Proof. Recall for a sequence of processes to converge to Brownian motion, it suffices to check that their 
finite dimensional joint distributions converge to that of a Brownian motion and Prohorov tightness 
criterion (see Theorem 16.5 of [5]). We prove them in the two lemmas below. 

Lemma 4.5. Let W} be defined as above. Then, 

(<° , w£° , ■ • ■ , <"' ) -* (B tl , B t2 , ■ ■ ■ , B t „), 

where B t is a standard one- dimensional Brownian motion starting at 0. 

Proof. It suffices to show that C(W} — Wl i |W P : r < s) is asymptotically N(0,t — s) and independent 
of {Iff™ : r < s}. This is a generalization of the central limit theorem for T n , which corresponds to 
the case s = 0, and also suggests that for general s, we can compare with the case s = 0. We couple 
W} — Ws with two extremely cases, as considered below. 

Let T(.) be the standard process of number of turns, and define 

U (m,n) • -^ | [m,n] 5 

that is, the number of turns in the segment of the walk during the time interval [m,n]. Then, one 
immediately sees that 

T\tn\ — ?\ sn j < [/[|_sn|,Ltn|], 

as the former may cancel turns created before time sn. 

On the other hand, we have the reversed inequality that 

T\t n \ ~ T Ls «J > ^[L^nj.Ltnj] - 2j min S k \, 

l<fc< l(t — s)n] 

and it suffices to estimate the behavior of | mini<fc<,! Sk\- Since 

L(t-»)nJ 

I = Ml < 

l<k<n 



miu \=M)< J2 W(Sk = -M)<c C /2d 1 ) 



we see that it is bounded by a geometric random variable, and thus 

— I min S k I — >• 

an l<k<(t — s)n 

in probability. Note that f/r mi „i has the same distribution as T n _ m , therefore C(Wt — W"\ W n (0, s)) con- 
verges to iV(0, t—s), and is clearly independent of W n (0, 2). By induction, we also get the asymptitically 
independent increment property for the sequence W n . 

□ 
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Lemma 4.6. W} n satisfies Prohorov's tightness condition, i.e., 

lim lim sup E[ sup | W t (n) - WJ n) | A 1] = 

h->0 n -Hx> \t-s\<h 

Proof. Let T n :— T n — d L~_L n be the centered number of turns at step n. The idea of the proof is 
similar to Ottaviani's maximal inequality for random walk on R (see Lemma 14.8 of [5]). 

Fix e > 0. Let t G [0,1) and ft G (0, 1-t). Define r := min{fc G (tn,(t + h)n] : \T k -T [tn] \ > 2eay/n}. 
Then, 

P(|5W)»J - ?Wj | > £<x^) > P( T < (t + ft)n, |5\(«+fc)„j - 2V| < eo<Jn) 

L(t+h)nJ 

= I] p(T = fc)P(|r L(t+ fc)„j-r A |<eo-v^|T = fc) 

> P(r < n) min P(|f L(4+h)nJ - T fc | < ea^\r = fc). 

fc£(£n,(i + /i)n] 

We want to bound P(r < n) = P(maxjc |2\ — Ti tn j | > 2ea^fn). First note that when hn is large 
enough, fi (t+h)n\ — ^\*»J behaves like a Gaussian with mean and variance ha 2 n. So, there exists N(e) 
such that for all n > — jpi, we have 

P(|T L(t+ k)„j ~ f [tnj | > eo^i) < exp {-e 2 /2h). (8) 



On the other hand, |!T|(t+fc)rOJ — Tj,| < Utk,(t+h)n) + M^ hn j. According to Proposition (|2.5|l . the variance 
of the former term on the right hand side is to (t + h)n — k with an error uniformly bounded in t, ft, k 
and n. The second term is dominated by a geometric random variable, and thus has a finite variance. 
So, by Chebyshev's inequality, we have 

P(|2W)»] - T k \ < eaVn-\r = fc) > 1 - W + h)n- k + C] ^ 

Since k takes values between tn and (t + h)n, the minimum bound is achieved at k — \tn\ + 1, so we get 

- — 2h C 

min P(|T L(t+h)nJ - r fc | < eo^\r = fc) > 1 - — - -, 
ke(tn,(t+h)n] e 2 n 

where C is independent of t, ft, k and n. The last inequality is valid as long as the right hand side is 
positive, which requires ft to take small values and n to take large values. These values depend on e only. 
The last bound, together with the bound (JSJ, imply 

P( max \T k -T tn \ > 2eo-y/n~) < (1 - — ) _1 exp (-e 2 /2h). (9) 

k&(tn,(t + hn)\ e 2 n 



Since for n > -i^r we have 



c- cr- 



P( sup |W$ - W t 1 "' I > c) < P( sup [T fc - f LtnJ | > -oy/n), 

Se(0,h) k£(tn,(t + h)n) * 

and the bound on the right hand side of (|9]) is independent of t, one gets 

sup P( sup I Will - W { t n) \ > e) < (1 - ^ - -r 1 exp (-e 2 /32h) 

t€(0,l) .56(0, h) e n 

for all n > max{— jpi, -4%}- Now divide the interval (0, 1) into [ij + 1 subintervals, each with length at 
most ft. Then, |t — s\ < ft implies that either s, t are in the same subinterval, or they are in two adjacent 
ones. This observation gives 

P( sup \W$ n) -W$ n '>\>e)<(%+2)(l-^-~)- 1 exp(-e 2 /32h) (10) 

\t— s\<h ft en 

for all n > max{7V(e), ^zfr}- Since 

E[ sup \W t {n) - W s (n) \ A 1] < e + P( sup \wi n) - W^ n) \ > 6), 

|t— s\<h \t-s\<h 
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the maximal inequality (|10|) quickly gives 

lim limsupE[ sup \W t {n) - W^ n) \ A 1] < e, 

f>—*0 „— »oo |t— s|<h 

which implies Prohov's tightness condition since e is arbritary. 

□ 



Combining the above two lemmas, we prove the invariance principle. 

□ 
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